|
|
Accession Number |
TCMCG075C17343 |
gbkey |
CDS |
Protein Id |
XP_007029766.2 |
Location |
complement(join(33152852..33152960,33153351..33153463,33154124..33154186,33154288..33154365,33154451..33154504,33154728..33154856,33155614..33155706,33155815..33156013,33156107..33156795)) |
Gene |
LOC18599649 |
GeneID |
18599649 |
Organism |
Theobroma cacao |
|
|
Length |
508aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007029704.2
|
Definition |
PREDICTED: RNA polymerase sigma factor sigA isoform X1 [Theobroma cacao] |
CDS: ATGATGGCCACAGCTGCTGTGATTGGACTTACCACTGGAAAGAGACTCTTGAGCTCTTCCTTTTCTTATTCTGATATCATAGAGAAGCTCTCATATGCCAATGATTATGGATCTTCACATCATCAGACTTCTTCCACAAAAAGTTTAATAGTTGCAAAAAAATCATCTAACTGTAGCCAAAGTCTTCCATCATCCAATCGGCGTGCTCAGTCAATTAAAGCTCTCAAAGAGCATGTCGATTCTGCCTCCATTGTTTCAACTGCAGAGCCTTTGTTTCAGGGATCCAATCACTTAGAAGTAGAAAGCTATGACCTTGACTACTCTGTGGAGGCTCTTCTTTTGCTGCAGAAGTCTATGCTGGAAAAGCAATGGACTCTTTCTTTTGAGAGGACAGTGTTCACTGAATCACCTAGTAGAAAAATTCACAAGAAGATACCTGTTACTTGTTCTGGGGTGTCTGCTCGGCAACGGAGATTCAATACAAAGAGGAAAATTCTGAGCCAAAATAAATCAATCATACAACCAAACGCTAAGCAGCTAAGATCTTTGATCAGTCCAGAGCTGCTTCAAAGTCGTTTGAAGGGTTATGTGAAGGGTGTAGTAAGTGAAGAGTTGCTCAGCCATGCAGAAGTTGTGCGCTTGTCAAAGAAAATCAAAGCTGGACTTTCCTTAGAGGAGCACAGGTTAAGATTGAAGGAGAGACTGGGATGTGAGCCTTCTGATGAACAGCTTGCAACTTCCTTGAAGATTTCTCGTGCTGAGTTACGGTCAAAGTTAATTGAATGTTCTTTGGCAAGAGAAAAATTGGCAATGAGCAATGTTCGTTTGGTTATGTCGATAGCTCAAAGATATGATAACATGGGTGCTGAAATGTCCGATCTTATTCAGGGTGGTTTGATTGGATTGTTGCGTGGCATTGAAAAATTTGATTCTTCAAAGGGATATAAGATTTCAACTTATGTGTACTGGTGGATACGTCAGGGTGTTTCTAGAGCATTAGTTGAGAACTCAAGAACATTAAGGTTGCCAACGCATTTGCATGAAAGACTGGGATTAATCCGAAATGCAAAAATTAGACTGGAAGAGAAAGGAATTACACCAACTATTGATAGGATTGCCGAGAGTCTGAACATGTCTCAGAAGAAAGTTAGGAATGCTACAGAGGCAGTCAGTAAGGTCTTCTCACTTGACAGGGATGCATTCCCCTCTTTGAATGGTCTTCCTGGAGAGACTCATCATAGTTACATTGCAGATAACCATGTAGAAAACATTCCATGGCATGGAGTAGATGAGTGGGCACTCAAGGATGAAGTGAACAGACTCATTACTATAACGCTTGGAGAACGAGAAAGAGAGATTATACGCCTTTATTACGGTCTAGATAAGGAAAGTCTTACATGGGAGGACATTAGTAAACGCATAGGTTTGTCCAGAGAGAGAGTCAGGCAAGTTGGACTTGTCGCGCTAGAGAAACTAAAACACGCAGCGAGGAAGAAGAAAATGGAGGCTATGCTAGTAAAACATTGA |
Protein: MMATAAVIGLTTGKRLLSSSFSYSDIIEKLSYANDYGSSHHQTSSTKSLIVAKKSSNCSQSLPSSNRRAQSIKALKEHVDSASIVSTAEPLFQGSNHLEVESYDLDYSVEALLLLQKSMLEKQWTLSFERTVFTESPSRKIHKKIPVTCSGVSARQRRFNTKRKILSQNKSIIQPNAKQLRSLISPELLQSRLKGYVKGVVSEELLSHAEVVRLSKKIKAGLSLEEHRLRLKERLGCEPSDEQLATSLKISRAELRSKLIECSLAREKLAMSNVRLVMSIAQRYDNMGAEMSDLIQGGLIGLLRGIEKFDSSKGYKISTYVYWWIRQGVSRALVENSRTLRLPTHLHERLGLIRNAKIRLEEKGITPTIDRIAESLNMSQKKVRNATEAVSKVFSLDRDAFPSLNGLPGETHHSYIADNHVENIPWHGVDEWALKDEVNRLITITLGEREREIIRLYYGLDKESLTWEDISKRIGLSRERVRQVGLVALEKLKHAARKKKMEAMLVKH |